Reputation Extraction Using Both Structural and Content Information
نویسندگان
چکیده
We propose a new method of extracting texts related to a given keyword from Web pages collected by a search engine. By combining structural pattern matching and text classification, texts related to a given keyword such as reputations of a given restaurant can be extracted automatically from Web pages in unfixed sites, which is impossible by conventional wrappers. According to our cross validation results on extracting reputations of a given Ramen shop from Web pages collected by a search engine, our method achieved 79.3% precision and 56.6% recall by allowing acceptable errors.
منابع مشابه
Analyzing the Structural and Outward Features and Expected Activities of a Service-Extension Agricultural Website in Iran
Background and Aim: This study aimed to identify and Analyzing the structural and outward features and expected activities of a service-extension agricultural website, based on the views of experts in the field of agriculture and related sciences and webmasters and blogs in Iran. Method: The methodological approach was a descriptive and survey study. The statistical population of the study cons...
متن کاملIdentifying Credibility Criteria in Scholarly Communication (Reading and Citing) form the Standpoints of Faculty Members of Kharazmi University
Background and Aim: In effect, every scientific endeavor consisted of scientific communication and scientists’ involvement in particular field of study; and scientific board members as the most outstanding elements play a key role in scientific productions. Therefore, a constructive scientific communication requires obtaining credible and valid information. In so doing, this study tries to inve...
متن کاملStructural Model of Brand Ambidexterity Impact on Brand Commitment through Brand’s Performance, Image and Reputation
Brand ambidexterity strategies help organizations improve their capabilities and performance and simultaneously discover new opportunities. The purpose of this study is to investigate the effects of brand ambidexterity strategies on brand commitment through brand’s performance, image and reputation. The statistical population of this research were the users of Pishgaman Company. Random sampling...
متن کاملContent Contribution for Revenue Sharing and Reputation in Social Media: A Dynamic Structural Model
This study examines the incentives for content contribution in social media. we propose that exposure and reputation are the major incentives for contributors. Besides, as more and more social media web sites offer advertising-revenue sharing with some of their contributors, shared revenue provides an extra incentive for contributors who have joined revenue-sharing programs. we develop a dynami...
متن کاملData Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کامل